75 results found.
Speech
Corpus,
Language Type:
Multilingual
Languages:
Amharic Bosnian Croatian Dari English French Georgian Haitian Hausa Hindi Korean Mandarin Chinese Persian Portuguese Pushto Russian Spanish Turkish Ukrainian Urdu Vietnamese Yue Chinese
Availability:
From Owner
License:
LDC
Size:
215 hoursProduction Status:
Existing-used
Use:
Language Identification
-
Paper title:Modeling and training strategies for language recognition systems
-
Paper track:4.1 Language identification and verification, lang/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Raphaël Duroselle | 2009 NIST Language Recognition Evaluation Test Set | /N |
Documentation:
None
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
Turkish
Availability:
From Owner
License:
LDC
Size:
200 hoursProduction Status:
Existing-used
Use:
Language Identification
-
Paper title:Modeling and training strategies for language recognition systems
-
Paper track:4.1 Language identification and verification, lang/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Raphaël Duroselle | IARPA Babel Turkish Language Pack | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Multilingual
Languages:
Cantonese English French German Gishu Greek Gujarati Hebrew Hindi Indonesian Japanese Korean Mandarin Persian Portuguese Runyankore Russian Spanish Turkish Vietnamese
Availability:
Freely Available
License:
OpenSource
Size:
22.8 GByte Production Status:
Newly created-in progress
Use:
Speech Recognition/Understanding
-
Paper title:Speaking rate, information density, and information rate in first-language and second-language speech
-
Paper track:1.10 Bilingual and L2 acquisition and processing/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Ann Bradlow | The ALLSSTAR Corpus | /N |
Documentation:
Documentation in English is available to the public (via the project website)
Written
Evaluation Data,
Language Type:
Multilingual
Languages:
Albanian Croatian English German Russian Turkish
Availability:
Freely Available
License:
Size:
10 MByte Production Status:
Existing-updated
Use:
Document Classification, Text categorisation
-
Paper title:XHate-999: Analyzing and Detecting Abusive Language Across Domains and Languages
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Mladen Karan | XHate-999 | /N |
Documentation:
There is an accompanying paper detailing dataset createion as well as a short readme with technical details that accompanies the dataset.
Not Applicable
Tagger/Parser,
Language Type:
Multilingual
Languages:
Czech English German Spanish Turkish
Availability:
Freely Available
License:
Size:
None Production Status:
Existing-used
Use:
Parsing and Tagging
-
Paper title:Understanding the effects of word-level linguistic annotations in under-resourced neural machine translation
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Víctor M. Sánchez-Cartagena | Stanford's NLP Parser | /N |
Documentation:
None
Written
Corpus,
Language Type:
Bilingual
Languages:
English Turkish
Availability:
Freely Available
License:
Size:
207678 sentences Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Understanding the effects of word-level linguistic annotations in under-resourced neural machine translation
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Víctor M. Sánchez-Cartagena | SETIMES | /N |
Documentation:
NoneLanguage Type:
Multilingual
Languages:
English German Portuguese Russian Turkish
Availability:
Not Available
License:
-
Size:
38000 words Production Status:
Newly created-in progress
Use:
Discourse
-
Paper title:Multilingual Extension of PDTB-Style Annotation: The Case of TED Multilingual Discourse Bank
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Deniz Zeyrek | Middle East Technical University | TR |
| Author 2 | Amália Mendes | Centre for Linguistics of the University of Lisbon | PT |
| Author 3 | Murathan Kurfalı | Middle East Technical University | TR |
| Main Contact | Deniz Zeyrek | Middle East Technical University | None |
Documentation:
An annotation manual in English exists. Currently only available for the annotators.Language Type:
Multilingual
Languages:
Turkish
Availability:
From Owner
License:
<Not Specified>
Size:
<Not Specified> <Not Specified>Production Status:
Newly created-in progress
Use:
Natural Language Processing
-
Paper title:Turkish Paraphrase Corpus
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Seniz Demir | <Not Specified> | None |
| Author 2 | Ilknur Durgar El-Kahlout | <Not Specified> | None |
| Author 3 | Erdem Unal | <Not Specified> | None |
| Author 4 | Hamza Kaya | <Not Specified> | None |
| Main Contact | Seniz Demir | TUBITAK-BILGEM | TR |
Documentation:
English, Available only from the owner
Written
Corpus,
Language Type:
Multilingual
Languages:
German Turkish
Availability:
Freely Available
License:
<Not Specified>
Size:
1029 tweets OtherProduction Status:
Newly created-finished
Use:
Corpus Creation/Annotation
-
Paper title:A Turkish-German Code-Switching Corpus
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Özlem Çetinoğlu | IMS, University of Stuttgart | DE |
| Main Contact | Özlem Çetinoğlu | IMS, University of Stuttgart | None |
Documentation:
<Not Specified>
<Not Specified>
Corpus,
Language Type:
Multilingual
Languages:
Turkish Uzbek
Availability:
<Not Specified>
License:
<Not Specified>
Size:
<Not Specified> <Not Specified>Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
-
Paper title:Bitext Name Tagging for Cross-lingual Entity Annotation Projection
-
Paper track:Under-resourced Languages
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Dongxu Zhang | Beijing University of Posts and Telecommunications | CN |
| Author 2 | Boliang Zhang | Rensselaer Polytechnic Institue | US |
| Author 3 | Xiaoman Pan | Rensselaer Polytechnic Institute | US |
| Author 4 | Xiaocheng Feng | Harbin Institute of Technology,SCIR lab | CN |
| Author 5 | Heng Ji | Rensselaer Polytechnic Institute | US |
| Author 6 | Weiran XU | Beijing University of Posts and Telecommunications | CN |
| Main Contact | Dongxu Zhang | Beijing University of Posts and Telecommunications | None |
Documentation:
<Not Specified>




